Model Selection

Multilingual speech synthesis

# Multilingual speech synthesis

Outetts 1.0 0.6B GGUF

OuteTTS-1.0-0.6B GGUF is a multilingual text-to-speech model that supports speech synthesis and cloning, providing efficient and accurate speech generation capabilities.

Speech Synthesis Supports Multiple Languages

Llama OuteTTS 1.0 1B

OuteTTS 1.0 is a multilingual text-to-speech model based on the Llama architecture, supporting 20 languages with high-quality speech synthesis and voice cloning capabilities.

Speech Synthesis Supports Multiple Languages

Llama OuteTTS 1.0 1B GPTQ 8bit

OuteTTS 1.0 is a 1B-parameter text-to-speech model supporting multilingual speech synthesis and voice cloning

Speech Synthesis Supports Multiple Languages

Voila Autonomous Preview

Voila is a large family of speech-language foundation models designed to enhance human-computer interaction, supporting real-time, low-latency voice interaction and multilingual processing.

Transformers Supports Multiple Languages

Voila Audio Alpha

Voila is a large family of speech-language foundation models designed to enhance human-computer interaction, supporting real-time, low-latency voice interaction and multilingual processing.

Transformers Supports Multiple Languages

Voila is a brand-new large-scale speech-language foundation model series designed to elevate human-computer interaction to unprecedented levels.

Transformers Supports Multiple Languages

Voila is a brand-new family of large-scale speech-language foundation models designed to elevate human-computer interaction to new heights.

Speech Recognition

Transformers Supports Multiple Languages

Voila Tokenizer

Voila is a large-scale voice-language foundation model series designed to enhance human-computer interaction, supporting multiple audio tasks and languages.

Transformers Supports Multiple Languages

XTTS V2 Argentinian Spanish

ⓍTTS is a voice generation model that can clone a voice with just 6 seconds of audio and apply it to different languages, supporting Argentinian-accented Spanish.

Speech Synthesis Spanish

XTTS V2 Argentinian Spanish

ⓍTTS is a speech generation model that can clone voices with just 6 seconds of audio and apply them to different languages. No need for hours of extensive training data.

Speech Synthesis Spanish

Latvian text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Mms Tts Uzb Script Cyrillic

Uzbek (Cyrillic script) text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis

Speech Synthesis

Mms Tts Urd Script Devanagari

Urdu text-to-speech model developed by Meta, supports Devanagari script transliterated text input to generate high-quality speech output

Speech Synthesis

Speecht5 TTS Haitian

A Haitian Creole text-to-speech model fine-tuned based on the SpeechT5 architecture, trained using Carnegie Mellon University's Haitian language dataset

Speech Synthesis

Transformers Other

Bark is a Transformer-based text-to-audio model created by Suno, capable of generating highly realistic multilingual speech, music, background noise, and simple sound effects.

Speech Synthesis

Transformers Supports Multiple Languages

TorToiSe is a text-to-speech program focused on multilingual capabilities and highly realistic prosody and intonation.

Speech Synthesis

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase